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Abstract 


The Adaptive Seeding problem is an algorithmic challenge motivated by influence maximiza¬ 
tion in social networks: One seeks to select among certain accessible nodes in a network, and 
then select, adaptively, among neighbors of those nodes as they become accessible in order to 
maximize a global objective function. More generally, adaptive seeding is a stochastic optimiza¬ 
tion framework where the choices in the first stage affect the realizations in the second stage, 
over which we aim to optimize. 

Our main result is a (1 —l/e)^-approximation for the adaptive seeding problem for any mono¬ 
tone submodular function. While adaptive policies are often approximated via non-adaptive poli¬ 
cies, our algorithm is based on a novel method we call locally-adaptive policies. These policies 
combine a non-adaptive global structure, with local adaptive optimizations. This method en¬ 
ables the (1 — 1/e) ^-approximation for general monotone submodular functions and circumvents 
some of the impossibilities associated with non-adaptive policies. 

We also introduce a fundamental problem in submodular optimization that may be of inde¬ 
pendent interest: given a ground set of elements where every element appears with some small 
probability, find a set of expected size at most k that has the highest expected value over the 
realization of the elements. We show a surprising result: there are classes of monotone submodu¬ 
lar functions (including coverage) that can be approximated almost optimally as the probability 
vanishes. For general monotone submodular functions we show via a reduction from Planted- 
Clique that approximations for this problem are not likely to be obtainable. This optimization 
problem is an important tool for adaptive seeding via non-adaptive policies, and its hardness 
motivates the introduction of locally-adaptive policies we use in the main result. 


1 Introduction 


The surge of massive digital records of human interactions in recent years provides a new system- 
wide perspective on social networks. In addition to observing and predicting patterns of collective 
human behavior, in many cases the dynamics of the network can be engineered. One such example is 
when attempting to initiate a large cascade by seeding it at certain important nodes in the network 
to promote a product or social movement through word-of-mouth. The algorithmic challenge of 
selecting individuals who can serve as early adopters of a new idea, product, or technology in a 
manner that will trigger a large cascade in the social network is known as influence maximization. 
Since it was first posed by Domingos and Richardson [HIM! and elegantly formulated and further 
developed by Kempe, Kleinberg, and Tardos [22], a broad range of algorithmic methods have been 
developed for this canonical problem m EllHl ttZllSnilSTlllHlEII . 

In many applications of influence maximization, despite having full knowledge of the network, 
one may only have access to a small slice of the network. In marketing applications for example, 
companies often reward influential users who visit their online store, or who have engaged with them 
in other ways (subscribe to a mailing list, follow the brand, install an application etc.). If we think of 
users who arrive at a store or follow a brand as being randomly sampled from the network, observing 
high-degree users is a rare event. This is simply due to the heavy-tailed degree distributions of social 
networks. Since influence maximization techniques are based on selecting high degree users (not 
necessarily the highest degree), their application on such samples can become ineffective. In general, 
access to high degree individuals in social networks is often rare, which raises the following question. 

Is it possible to design effective influence maximization strategies despite the rarity of influencers? 

To tackle the problem or rare influencers, the adaptive seeding framework was recently developed 
in |36j . The framework is a stochastic optimization model which formalizes an intuitive approach: 
rather than spend the entire budget on the non-influential users, we can spend a fraction of the 
budget on the accessible users, wait for their friends to appear as a result, and optimize influence 
by using the remaining budget to seed influential friends. The idea is to leverage what’s known as 
the friendship paradox [T5], which suggests that although people are not likely to be influential, 
they are likely to know someone who is. In |23| it was shown that in well-established mathematical 
models of social networks there are asymptotic gaps between the degree of a random node and its 
neighbor. This structural property implies that dramatic improvements to influence maximization 
are indeed achievable by optimizing influence over friends. In recent work [20|, along with scalable 
algorithms for this problem, it was shown through various experiments on the Facebook graph 
that dramatic improvements to naive application of influence maximization are indeed obtainable 
through adaptive seeding. 

The adaptive seeding model. The adaptive seeding model is a two-stage stochastic optimiza¬ 
tion framework. We are given a set of nodes X, their set of neighbors Afi^X), each associated with 
a probability p*, as well as a budget k G N and a function / : 2^^^^ —)• M. In the first stage, a set 
Sex can be selected, which causes each one of its neighbors to materialize independently with 
probability p*. In the second stage, the remainder of the budget can be used to optimize the function 
/(•) over the realized neighbors. This function quantifies the expected number of individuals in the 
network that will be influenced as a result of selecting a subset of early adopters. The goal is to 
select a subset 5 C X of size at most k s.t. the function can be optimized in expectation over all 
possible realizations of its neighbors with the remaining budget k — |5|. Equivalently, our goal is 
to select sex s.t. in expectation over all possible realizations Ri,..., Rm of M{X) the value of 
a set of its neighbors Tj of size /c — |5| that appears in the realization Ri is optimal. 
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Stated in these terms the objective is: 

m 

~ 2=1 

TiC Rif] M{S) Vi G [m] 

I'S'I + \Ti\ < ^ Vi G [m] 

The adaptive seeding formulation models the challenge of recruiting neighbors who can become 
effective influencers using the budget, rather than trying to influence them to forward the informa¬ 
tion without incentives as in the standard influence maximization framework^ Although it seems 
quite plausible that the probabilities of attracting neighbors could depend on the rewards they 
receive, the simplification assuming unit costs is deliberate. This simplihcation is consistent with 
the celebrated Kempe-Kleinberg-Tardos model [22], and can be extended to the case where nodes 
take on different costs m- 

While the motivation is influence maximization, adaptive seeding is a versatile framework of 
stochastic optimization (see related work section for further discussion on stochastic optimization). 
At its essence, it formalizes the following question. 

Given a distribution on the consequences of the actions we take in the present, can we design 

algorithms that optimize events in the future? 

The main question when considering stochastic optimization models is whether the same guar¬ 
antees as standard optimization can be obtained |38] . It is not hard to show that for very simple 
objective functions such as f{S) = IS"!, the two-stage optimization is NP-hard even in the non¬ 
stochastic case, i.e. when all probabilities are one. Thus, approximation is needed. Therefore, in the 
context of adaptive seeding the question is whether an objective that can be well approximated in 
standard optimization can also be well approximated in adaptive seeding. 

Submodularity. A function / : 2^ —M+ is submodular if f{S U T) < f{S) + f{T) — f{S n T). 
Equivalently, a function is submodular if it has a natural diminishing returns property: for any 
S' C T C and a G N\T a function is submodular if / 5 (a) > fria), where fA{B) = f{A[JB) — f{A) 
for any A,B ON. Unless otherwise stated, we will assume the function is normalized (/(0) = 0) and 
monotone {SOT implies /(S) < f{T)). In standard optimization submodularity is a certificate 
for desirable approximation guarantees ( 1 - 1 /e approximation for maximizing such functions under 
a matroid constraint m), and a slightly broader class known as fractionally subadditive functions 
already exhibits strong information-theoretic lower bounds |29j . 

Adaptive Seeding of submodular functions. The effectiveness of adaptive seeding depends 
crucially on our ability to optimize general classes of submodular functions. Say that a function 
can be adaptively seeded if the adaptive seeding problem can be approximated within a constant 
factor for this objective. The main result in |36j shows that any function in a class known as the 
triggering model — a special class of monotone submodular functions defined in the seminal work 
of Kempe, Kleinberg, and Tardos |2^ — can be adaptively seeded. But far more general models of 
submodular functions are used to describe influence in social networks I 22 IEI], and the techniques 

^Having probabilities on the nodes in Af{X) and not the edges implies that the neighbors are not more likely to 
appear if they have more parents in X selected by the algorithm. This corresponds to what’s known in microeconomics 
as a standard Bayesian utility model with no externalities. For submodular functions, this problem can be shown to 
be equivalent to the one where probabilities appear on the edges and not the nodes. This corresponds to a model 
where nodes in the second stage are influenced by nodes in the hrst stage through the independent cascade model |22| . 
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of [36] cannot be applied to these (see related work section for further discussion on submodular 
function, their connection with the influence maximization problem and the limitation of previous 
techniques for adaptive seeding). Thus the main question in this work is: 

Can any submodular function he adaptively seeded? 

Naturally, we will seek algorithms that obtain the best approximation ratio possible. The main 
challenge would be to obtain an algorithm that achieves a (1 — 1/e)^ approximation ratio for general 
monotone submodular functions. This bound is a natural goal for this problem, as we discuss below. 

1.1 Warm up: the non-stochastic case 

We begin by considering the non-stochastic version of the problem. That is, the version in which 
every node in the set of neighbors appears with probability one. Here we are given a set X and 
its neighbors Af{X), there is a monotone submodular function dehned on J\f{X), and the goal is 
to select t < k elements in X connected to a set of size at most k — t in AA(X) for which the 
submodular function has the largest value. A trivial solution would be to run a greedy algorithm 
with budget of k/2 on parent-child pairs. That is, at every stage select the node in M{X) whose 
marginal contribution given the nodes already selected is the largest, and add one of its parents 
(if none have been added in previous rounds) to ensure the solution is feasible. Since we need at 
most k/2 parents to select k/2 children the solution is feasible, and submodularity guarantees the 
solution’s value is at least half of the value as if we were to run the algorithm with a budget of 
k, which is an upper bound on the optimal solution. It is well known that the greedy algorithm 
is a 1 — 1/e-approximation to the optimal solution, and hence this trivial algorithm would be a 
(1 — 1/e)/2-approximation. 


Optimal approximation via e-blocks. A natural extension to the above approach would be 
to pair parent nodes (i.e. nodes in X) with subsets of their children. We call such pairs blocks, and 
the density of a block is simply the ratio between its marginal contribution and its size: for a set 
of children T C J\f{X), the marginal density of a block (x, B) with respect to T is 

fT{B)/{l + \B\). 


Ideally, for each parent we would add the subset of children which makes for the densest block, as 
one can then show that an algorithm which iteratively adds the densest block results in the optimal 
1 — 1/e approximation. However, even for coverage functions finding the densest block implies 
solving an NP-hard problem. Instead of densest blocks we can consider using e-blocks: a node in 
X € A and a subset of its neighbors of size at most 1/e. Note that for any constant e > 0, finding 
the densest e-block can be done in polynomial time by brute forcing all subsets of 1/e neighbors of 
every node x £ X and computing their value. Importantly, one can show that for any e > 0 the 
densest e-block H is a (1 — e) approximation to the densest block O if |0| is larger than 1/e: 


MB) 

l + l^l 




MO) 
i + |o| 


where the first inequality is by submodularity and taking the 1 /e elements of O with highest value. 

The algorithm is now simple: until exhausting the budget, add the densest e-block to the solu¬ 
tion. For any e > 0, this algorithm is a (1 — 1/e — e) approximation, and the idea of the analysis 
is natural; at every stage this algorithm selects an e-block that is a (1 — e) approximation to the 
densest block, and by applying a standard inductive argument one can show that this results in a 
(1 — 1/e — e) approximation. Although it only holds in the dummy non-stochastic version of the 
problem, this approach encapsulates the core idea in this paper. 
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1.2 Synopsis 

In the stochastic case the optimal solution is an adaptive policy: one which selects a subset from 
X, and after the realization of its neighbors, selects an optimal solution with its remaining budget. 
Since adaptive policies are notorious in stochastic optimization for their difficulty, the standard 
approach is to design non-adaptive policies which approximate adaptive policies well. Informally, 
a non-adaptive policy is a policy which selects the subset S and a set of its neighbors, a priori to 
their realization. We cannot hope to obtain an approximation better than 1 — 1/e for the optimal 
non-adaptive policy unless P=NP CD, as the non-stochastic case is a special case. As we show 
later, the ratio between non-adaptive policies and adaptive ones can be as bad 1 — 1/e, and we 
therefore naturally seek algorithms whose approximation ratio is (1 —1/e)^. We do not know whether 
(1 — 1 /e)^ is the optimal approximation ratio, and it is one of the main open questions in this paper. 

Similar to the exposition in the warmup above, finding a (1 — 1/e) approximation to the optimal 
non-adaptive policy (and hence a (1 — 1/e)^ approximation to the optimal adaptive policy) reduces 
to finding e-blocks with arbitrarily good precision in the stochastic case. The key challenge in com¬ 
puting e-blocks here reduces to the following fundamental problem (which may be of independent 
interest) we call submodular-optimization-with-small-probabilities (SOSP): given a ground set of 
elements where every element appears with some small probability, find a set of expected size at 
most k that has the highest expected value over the realization of the elements. While for some 
classes of submodular functions near-optimal solutions for this problem can be obtained, arbitrarily 
good approximations for this problem are not likely to be obtainable in general. In other words, 
non-adaptive policies do not suffice for getting a (1 — 1/e)^-approximation algorithm. 

Our main result builds on an alternative strategy for defining e-blocks. Instead of non-adaptive 
policies we employ what we call locally-adaptive policies. Intuitively, a locally adaptive policy con¬ 
sists of a set of e-blocks, where within each block the policy can make adaptive decisions. The 
adaptivity within a block lets us hnd the optimal e-block and enables the (1 — 1/e)^ guarantee. 

1.3 Results 

• Tight Adaptivity gap. In Section [2] we show that non-adaptive policies approximate adap¬ 
tive policies within almost a factor of 1 — 1/e. We then show that this gap is tight. 

• Algorithm for SOSP. In section 12.11 we introduce this problem and show how it can be 
solved almost optimally for matroid-rank-sum functions (which include coverage functions) 
by convex programming as the probabilities vanish. We then show how to use this problem 
to design a (1 — l/e)^-approximation algorithm for this class of submodular functions. 

• Hardness of SOSP. We show that for general monotone submodular functions, the problem 
is hard to approximate to arbitrary precision by a reduction from the Planted-Clique 
problem. Thus, for general submodular functions computing the optimal e-block is also hard. 

• Our main result: A (1 — 1/e) ^-approximation algorithm through locally-adaptive 
policies. In Section [3] we describe our main algorithm designed for any monotone submodular 
function using value oracles. The algorithm finds a locally-adaptive policy whose value is 
guaranteed to be at least about 1 — 1/e of the value of the optimum locally-adaptive policy. 
Naturally, it remains to prove that the best locally-adaptive solution is 1 — 1/e away from the 
true optimum. We establish this by showing that any non-adaptive policy can be approximated 
arbitrarily well by a locally-adaptive policy, and utilizing the bounds we have for such policies. 
The idea of locally-adaptive policies is new and may be of independent interest. 

• Adaptivity gap for locally-adaptive policies. In section we exhibit a gap (~ 0.853) 
between the optimal locally-adaptive policy and the optimal adaptive policy. 


4 


1.4 Related work 


Adaptive Seeding. The Adaptive Seeding model was introduced by Seeman and Singer in [36]. 
They use a concave relaxation to achieve a constant bound approximation for the adaptive seeding 
problem with influence functions in the Triggering model. Unfortunately, their techniques do not 
extend to general submodular functions. In this work we introduce new non-adaptive and adaptive 
techniques for this problem, and achieve a better approximation bound for the Triggering model. 
Most importantly, our results hold for the entire class of submodular functions. In a follow-up 
paper [35| the adaptive seeding problem is studied under knapsack constraints. While the techniques 
used in that paper are applicable here, they give a different approximation bound than what we 
achieve here, which as shown in that paper is actually the appropriate bound for that case. However, 
as we show, in the cardinality case we can get better approximation bounds. 

Submodular Functions. Monotone submodular set functions maximization is an extensively 
studied problem. Nemhauser et al. [32| show a simple greedy algorithm that achieves a (1 — 1/e)- 
approximation for maximizing monotone submodular functions under cardinality constraints. 
Feige [13] shows that this is the best possible unless P=NP. A continuous version of the greedy 
algorithm was introduced by Calinescu et al. [6] to solve a multilinear extension of the problem 
which combined with the pipage rounding [T] techniques is shown to give a (1 — l/e)-approximation 
under matroid constraint for a special class of submodular function. Vondrak m shows that similar 
ideas can be used to give the same approximation for all monotone submodular functions. Checkuri, 
Vondrak, and Zenklusen [7] extend this framework and introduce Contention Resolution Schemes 
to show how to account for multiple matroid and knapsack constraints and also extend to non 
monotone functions. In this paper we use the contention resolution and a bound from [6] to bound 
the adaptivity gap between non-adaptive and adaptive policies. 

Submodular functions have been a crucial tool in the influence maximization literature. Kempe, 
Kleinberg, and Tardos [22] show that a class of influence models called Triggering model are all 
submodular functions and thus can be approximated by a greedy algorithm as discussed above. 
Mossel and Roch m proved a conjecture posted in |22j and show that far more general influence 
models can be expressed by a submodular function. These models have been studied in numerous 
papers both theoretically and empirically [a El El Ea ESI Ea EH], and thus are the main focus of 
this paper as well. 

Stochastic optimization. The adaptive seeding model is a stochastic optimization framework 
(see m for a survey). There has been extensive work on stochastic optimization problems in 
the context of approximation algorithm varying from two-stage with recourse minimization prob¬ 
lems ([l9],[2a, [33]) [MUSS] ) @2] ) of maximizing an objective under a budget constraint f [TO]. 
mm)- The adaptive seeding model is different from these problems as it combines a two stage 
model with a maximization under budget problem. Moreover, the constraint structure in adaptive 
seeding is such that the first stage decisions determine the available actions in the second stage (not 
just through the shared budget constraint). Another variant of multi-stage stochastic submodular 
maximization was studied by Asadpour et al. [3] and Golovin and Krause [16] . Despite the similar 
name, the adaptive seeding model is substantially different from these models, both in motivation 
and in the fact that the second stage problem is strongly dependent on choices made in the first 
stage. In these models the algorithm has access to the entire network but has the freedom to choose 
one node at a time (or group of nodes) and observe the realized value of these nodes. Yang et 
al. [33] and Chen et al. [9] also study “multi-level” models of the influence maximization problem, 
the latter partially inspired by the adaptive seeding model. However, their motivation, model and 
benchmark are substantially different from the adaptive seeding model, and apart from some special 
cases, their models do not have any constant factor approximations. 
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2 Non-adaptive policies 

A non-adaptive policy is a pair of sets (5, T) C X x M{X) where S represents the set selected in the 
first stage and T C AA(5') is the set selected in the second stage. The natural definition for feasibility 
would be that the policy selects at most k nodes, though it is easy to construct examples that show 
that such strict policies have an unbounded approximation ratio. We therefore consider relaxed non- 
adaptive policies whose guarantee is to select at most k nodes in expectation, where the expectation 
is over the randomization in the model, i.e. the probabilities of nodes arriving in the second stage. 
In the rest of the paper we drop the relaxed prefix and just call such policies non-adaptive policies. 
We define the value of a non-adaptive policy (S', T) to be FiT) = ^i) its 

cost as |S| +C{T), where C{T) = Yli&TPi- Finding the optimal non-adaptive policy requires solving 
the optimization problem: 

□PTjva = max{F(r) : |S| + C(r) < A:, S C X, T C AA(S)} . 

The crucial difference between these policies and adaptive ones is that non-adaptive policies fix a 
set T a priori to seeing the realization, whereas adaptive policies have the luxury of selecting a 
different set Tj for every realization Ri. Note that these policies are not a relaxation of adaptive 
policies nor are they a special case, since on the one hand they fix one set T but on the other hand 
are only restricted to the budget in expectation. Thus, it is not immediately clear they are useful 
for approximating adaptive policies. 

We first show that the adaptivity gap (i.e. the ratio between the value of the optimal adaptive 
policy and that of the non-adaptive policy) is exactly 1 — 1 /e. 

A tight 1 — 1/e adaptivity gap. Consider an instance with a single node in X connected to 
n = 1/(5^ nodes, each appearing with probability 6, for some small d > 0. The function is: 

f{T) = l^ ifr /0 

1^0 otherwise. 

For a budget of 2, an optimal adaptive policy seeds the single node in X, waits for the realization 
of its neighbors, and seeds whichever node realizes. The optimal non-adaptive policy here seeds the 
single node in X and spends the rest of its budget on l/d nodes in J\f{X), which has an expected 
utility of 1 — (1 — ~ (1 — 1/e). The adaptivity gap is therefore at least 1 — 1/e. 

We complement the above example by showing that the adaptivity gap is at most 1 — 1/e. 
The proof utilizes an interesting connection between the optimal adaptive policy and the concave 
closure of the underlying sub modular function [ 6 ]. Using this connection and properties of the 
concave closure from [ 6 ] we can bound the error of a fractional non-adaptive policy that we then 
round to get our result. 

Lemma 2.1. For any budget k, OPT^a > (1 — 1/e — 2/k)OPTA- 

Proof. Let S be the set chosen by the optimal adaptive policy and let Tj be the set chosen by this 
policy in realization i?*. For a set T C AA(S') let ax be the total probability that T is chosen by 
the adaptive policy in the second stage. That is, ax = '^i^{i\x=Ti}P(^i)- ^ T* be 

the probability that i is seeded over all realizations in the second stage. That is, q* = J2{x\i€T} 
Consider the function: 

/+(q)=max{ ^ /3T/(r)| ^/3 t = 1;/3t > 0 ; ^/^tIt = q}- 

^ TCAPS) T T 
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Obviously, ~ ^ ~ Thus a is a valid /3 for which /^(q) optimizes over. 

Since OPT^ = Yr^TfiT) this mean that /"'"(q) > OPT^- 

Consider instead the process in which at the second stage each element yi G AA(S') is chosen 
independently with probability q* and call the value of this process T(q). By a consequence of [ 6 l 
Lemma 5] we know that T(q) > (1 - l/e)/+(q) > (1 — l/e)DPT^. Now note that q* < p* as an 
item can’t be seeded with a higher probability than the probability it realizes. Thus, there exists 
t G [0, such that for every i G M{S) q* = t^pj. Since in each realization only A; — IS"! element 
are chosen, we know that t^p < A: — l^l. Thus, t can be understood as a fractional solution for the 
second stage set of the non-adaptive policy. We can round t using the pipage rounding [T] technique 
in order to get a vector with only one fractional solution and no loss in value. Note that we can 
assume without loss of generality that the fractional entry has the smallest marginal density out of 
all non-zero entries and that there are no fractional entries if t^p < k — [S'!. 

First assume that IIS'! < k/2. To get T we take the items for which the rounded vector entry is 
1. From submodularity and the fact that t^p > k/2 we get that the solution is a valid non-adaptive 
policy and has at most 2/k loss of the fractional solution. If |5| > A;/2 we can instead remove an 
item of S (and all the entries that are connected only to it) with the least marginal value and 
include the fractional entry in T (if it is not connected to that item). From submodularity, we can 
divide the value of the solution between the different items of S such that at least one of them has 
marginal value less than 2/k so we get a valid solution with at most 2/k loss. So we constructed a 
valid non-adaptive policy whose value is at least (1 — 1/e — 2/A:)DPT^ and thus get our result. □ 

We next show that a non-adaptive policy can be converted to an adaptive policy with small 
loss by using the contention resolution scheme [7|. 

Lemma 2.2. For every e G (0, 1/5) and for any non-adaptive policy {S, T) such that A: — IS"! > e~'^, 
there exist an adaptive policy with value > (1 — 2e) F{T). 

Proof. Given a solution (5, T) consider an adaptive policy which seeds the same set S in the first 
stage; in the second stage, for every realization R of neighbors of S the policy selects each node 
j G i? n r with probability (1 — e) into a set T and seeds the nodes in T if |r| > ^ = A: — |5| and 
otherwise does not seed any nodes. We next compute the probability of any element j to be seeded 
given that it is in T. 


Pr 


j is seeded | j G T 


= Pr 
= Pr 
= 1 — Pr 

> 1 — Pr 

> 1 — exp ( — 


|T| < e I j G T 


^-1 


(l-e)(?-Pi 

(l-e)(g-pj), ^ - 1 


■( 


(1 - e)(C - pj) 


T\{j}\ 

-If 


- 2 


> 1 — exp — 

V -3 

> 1 - exp (-e"^) 

> 1 - e 


We derive the first inequality from (1 — e)(.^ — pj) > E[|T \ {j}|]. Notice that since e < 0.5, we have 
that G (1, 2), and thus the second inequality follows form Chernoff bound (See appendix 
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for the exact bound used). The third inequality follows from simple arithmetic derivations. The 
fourth is from substituting ^ with and nsing the fact that e < 1/5. 

We therefore have that the probability j is seeded given that it is was realized (in K) is at least 
(1 — e)^ > (1 — 2e). We also have that the seeded set is always of size at most In addition for any 
element j and any two realizations Ri and i ?2 such that j S i?i T R 2 we have that the probability 
j is seeded when the realization is Ri is higher than if the realization is R 2 - These three condition 
define a monotone (1 — 2e)-balanced contention resolution scheme [7] and thus using the results 
from [7] we get that the expected value of this process (and thus of the adaptive policy) is at least 
(l-2e)T(T). □ 

Combining these results we can prove the following theorem: 

Theorem 2.3. For every e > 0, given an algorithm that finds a non-adaptive policy with value 
at least 'yOPTna, there is a {1 — l/e )7 — e approximation algorithm for the optimal adaptive policy. 

Proof. Rnn the non-adaptive algorithm to get a non-adaptive policy {S,T), with an approximation 
of 7 > 0 for DPTat^. First, assume that both k > 4/e and A: — ISI > (4/e)^. We can then use the 
same adaptive policy as in Lemma [2.21 with parameter e/4 and let Adapt{S, T) be the value of that 
policy. 


Adapt{S,T) > (1 - e/2) F{T) 

> (1 - e/ 2 ) yOPTwA 

> ((l-e/ 2 )(l-l/e-e/ 2 )) 70 PTA 

> ((l-l/e)7-e)0PTA 


where the first inequality is due to Lemma 12.21 and the third is due to Lemma O 

If A: — I S'! < c = (d/e)"^ we can iteratively remove from S the [c] elements that contribute the 
least to the value of the solution, as well as the elements of T that are connected only to the removed 
elements. Let S',T' be the result of this procedure. Notice that A: — IS"! > c. As we removed the 
elements with the least value we know that F{T') > (1 — -|^)T(T), and thus by the same argument 

as in the previous case we get that Adapt{S', T') > (1 — -|^)Adopt(S', T). It is easy to check that if 

k > 0 (-^) we still get the desired approximation ratio. 

If k does no satisfy one of the conditions above (so smaller then some constant) we can find an 
optimal adaptive policy by a brute force search over sets of size at most k in the first stage (we can 
approximate their value to any desired accuracy by sampling realization and finding the optimal 
second stage set). □ 

2.1 Optimization via Non-Adaptive Policies 

Given the blackbox reduction in Theorem 12.31 one can consider the problem of designing algorithms 
for non-adaptive policies. We now describe the simple greedy algorithm NonAdaptiveGreedy 
which is similar to the one sketched in the Introduction: at each step, as long as it doesn’t exceed 
the total budget, the algorithm adds the densest e-block. For non-adaptive policies, an e-block is 
a node x ^ X and a subset of its neighbors whose expected cardinality is at most 1/e. A formal 
description of the algorithms follows in Algorithm [TJ It assume a black-box access to an algorithm 
that finds an approximate optimal e-block called FindOptimalNonAdaptiveBlock. 

The next lemma shows that for any a > 0, a procedure which guarantees an er-approximation 
for the optimal e-block translates to a (1 — l/e“ — e)-approximation guarantee for the optimal 
non-adaptive policy. 


Algorithm 1 NonAdaptiveGreedy 
Input: / ; —)• M+, budget k. 

1: 5 ^ 0, q ^ 1?. 

2: while I S'! + Ylj^TPj < k — j do 

3: {x,B) FindOptimalNonAdaptiveBlock(S',T) 

4: {S,T) = {SUx,TU B) 

5: end while 
6: return (5, T) 


Lemma 2.4. Ve > 0, assume that in every iteration FindOptimalNonAdaptiveBlock returns 
a block which is an a-approximation to the optimal e-block. Then, when k = n(l/e^) Algorithm 
NonAdaptive returns a solution {S,T) such that F{T) > (1 — l/e“ — 0(e)) OPTna- 

Proof. Let {Sj,Tj) be the solution at the beginning of iteration j. For a block {x,B) let 
Ftj{B) ='^■^^.^^p{Ri){f{{Tj U B) n Ri) — f{Tj n Ri)). Let {xj,Bj) be the solution returned by 
FindOptimalNonAdaptiveBlock at iteration j and let {xo,Bo) be the optimal e-block in this 
iteration. First we observe that similarly to the non-stochastic case the optimal e-block is a (1 —2e)- 
approximation of the densest block. Consider an iteration j. For any block {x,B) we have that if 
C{B) < 1/e than the optimal e-block has at least the same marginal density. Otherwise, let be 
the set of elements of B of highest marginal value of cost at most 1/e. 

Ft,{B o) ^ Ft,{B,) ^ '^Ft,{B) ^ i-eFr^jB) ^ Ft,{B) 

l + C{Bo) - l + C{B,) - 1 + 1/e -l + eC{B)-^ 1 + G(.B) 

where the first inequality is because Bq is the optimal e-block and Bg. is a candidate, the second 
is because B,. is at least of size 1 /e — 1 and from the submodularity of the function, and the other 
steps are just simple algebra. 

We can think of the optimal non-adaptive solution as a set OCA and arbitrarily partition the 
nodes in Rf{0) such that for each node oi G O we associate a set of children Oi C M{oi). The cost 
associates with each node and its children is simply 1 +C(0£). Thus, we have that: 


FT,{Bj) ^ Ft,{B o) ^ Ft,{0,) ^ , EeFT,{0,) ^ g, , Ft,{0) 

> az — . > a (1 — 2e) max -— > er (1 — 2e) — > a (1 — 2e) 


l-\-C{Bj) \C{Bo) 


i 1 + C{Oi) 


Yhi 1 + c-{Oi) 


k 


where the second inequality is from the last equation, and the last is from submodularity. 
We proceed by induction to show that at any iteration we have that 


F(r,,.)>(l-n(l-a(l-2.)i±^) 


OPTna 


9 


















The base case is trivial. Now assume it is true for F{Tj). Then we have that 


+ « (1 “ 2 e) 
= ( 1 - a(l -2e) 


l + C{Bj] 


k 


1 + 




> 1 - a(l - 2e) 


k J 

1 + C (Bj 
k 


{OPTna - F{Tj)) 
F{Tj) + a (1 - 2e) 


k 




i=i 


1_JJ l_«(l_2e) 


l+C{Bi) \ 


+ a (1 - 2e) i^-^^DPTjvyi 


OPTna 


i=i 


1_ JJ l_«(l_2e) 


1 + CiB,) 
k 


DPTtva 


where the first inequality is from the previous derivation and the second inequality is by the 
induction hypothesis. 

After the last iteration the cost of the solution is greater than A: — ^ > (1 — 3e) k. Therefore, 
FiT) > OPTna 


e=i 


> 1 -( 1 - 


«(l-26)(l-3e),, 


k J 
OPTata 


> 1 - 


1 


go(l—5e) 


OPTata 


1 


— - 0{e)j OPT^a 

Where the second inequality is because setting all of the C{Bj) to be equal minimizes the function 
and we know that their sum is at least (1 — 3e) k. □ 


2.2 Finding optimal non-adaptive e-block 

Lemma 12.41 shows that finding good approximation for non-adaptive policies reduces to computing 
approximations for the optimal e-block. In this section we show that for some special cases we can 
find an optimal e-block and thus a (1 — 1/e) approximation for the optimal non-adaptive policy in 
polynomial time. Unfortunately, for general submodular functions we show it is unlikely that it can 
be approximated arbitrary well. 


2.2.1 Approximating optimal e-block for large probabilities 

In the first special case we consider, all the probabilities on nodes are larger than some constant. 
Here, the optimization problem is easy. Given some constant e > 0 the algorithm simply enumerates 
over all X e A and over all possible subsets of items T £ M{x) s.t. C{S) < 1/e. 

Corollary 2.5. Let 6 = miujgf^jpj. Then for any constant e > 0, we can approximate the optimal 
non-adaptive policy to within (1 — 1/e — e) in time poly(n^/^). 

In the rest of this section we turn to the more challenging task of seeding nodes with small 
probabilities. 
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2.2.2 Approximating the optimal e-block for MRS objective 

In case the probabilities are small enumerating over all possible solutions is computationally infea¬ 
sible. The problem of finding e-blocks reduces to the following fundamental problem. 

Definition 2.6. Submodular-optimization-with-small-probabilities-(5 (SOSP-S) problem: 
We are given a monotone submodular function f and probabilities of each element realizing p. The 
probabilities satisfy maxj p* < 6. Our goal is to find a set T of expeeted size k that maximizes the 
expeeted value of f. 

We’ll look at the fractional version of this problem in which the items of T are chosen indepen¬ 
dently with probabilities q (this can be thought of as a multi-linear relaxation [6] of SOSP). Since 
we are only interested in small values of 6 it is easy to round fractional solutions using the pipage 
rounding [T] technique with very small loss. Formally, we want to solve 


max E[/(r)] = ^ I JJqi JJ(1 - qi) j /(T) 

T \ieT i^T ) 

s.t. <k 

i 

qj G [0, Pi] Vi G [n] 

At a first glance, it may seem like no algorithm should be able to get an approximation better 
than 1 — 1/e for this problem: when 5 = 1 the problem identifies with submodular maximization 
under a cardinality constraint, and due to Feige we know that no algorithm can do better than 
1 — 1/e unless P=NP even for coverage functions |14j(even for the fractional version). It seems like 
shrinking the constraint polytope by a factor of 6 should not make a difference in the optimization. 
Surprisingly, as we next show, for submodular functions in a class known as matroid rank sum (which 
includes coverage functions), the above optimization problem can be solved nearly optimally. At a 
high level, we show that for such functions the problem can be well approximated through a convex 
program, which then enables us to produce a solution whose approximation becomes optimal as 6 
vanishes. H 

Theorem 2.7. Suppose that f can be represented as a matroid rank sum (MRS) funetion. Then, 
there exists a (1 — 5/2)-approximation algorithm for SOSP-d using eonvex programming. 

Proof. Suppose that we relax the program, so that the probability of seeding node z is 1 — e~'^' 
(but the cost remain the same). We can now optimize the following program: 

T \iGT i^T ) 

E 

i 

qi G [0, Pi] Vz G [re] 

Dughmi et al. m consider essentially the same program in the context of Poisson rounding and 
show that this program is concave when / is a MRS function. (They use it to achieve a (1 — 1/e)- 
approximation for general probabilities and do not consider the special case of small 5.) Thus, we 
can optimize this program (to within arbitrarily good approximation) in polynomial time. 

^Note that this is not due to the low cost of elements which allows for example for a greedy algorithm to be nearly 
optimal for the knapsack problem. 
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Observe that 1 — < q*. Therefore we only decreased the probability of seeding each node, 

so by monotonicity of /, our expected value will be at least as good as the solution of the new 
concave program. 

We lose at most a factor of (1 — e~^)/5 for any submodular function / (and in particular for 
matroid rank sum). Think of the process where the elements are added one by one, and consider 
the marginal contribution of each one. Let /r = ^ (q) denote the original distribution on sets (i.e. 
Pr^j[r] = OieTTi • ~ ^*))) l^t V = (q) denote the transformed distribution (i.e. 

V^iy\T] = niGT(^ “ l^t and Fi, denote the value of the objective function 

under each distribution. 

Fuici) = 

> 

> 


The first step follows by considering the expected increment for adding i, with respect to v. The 
second step follows by lower bounding and monotonicity. The third step follows by submodu¬ 
larity. Finally, the last step follows by again considering the expected increment for adding i, this 
time with respect to /i. □ 

Finally, we obtain a tight approximation for the non-adaptive solutions to adaptive seeding 
instances with MRS objective and arbitrary probabilities by carefully combining the two special 
cases. 

Theorem 2.8. For any e > 0 there is a polynomial-time algorithm that returns a (1 — 1/e — e)- 
approximation of the optimal non-adaptive policy for any matroid rank sum (MRS) function. 

Proof. Run Algorithm NonAdaptive with subroutine FindOptimalNonAdaptiveBlock that 
finds e'-blocks implemented as follows: Enumerate over all feasible subsets of nodes with probabil¬ 
ities at least 6. For each subset, let k' be the remaining budget for this block. Solve the concave 
program for budget in {e”, 2e'',... ,k'). By Lemma 12.71 when our enumeration reaches the optimal 
subset of large-probabilities elements, and we use the approximately correct additional budget (we 
spend at most an additional e budget), the solution of the concave program is a ((1 — 5/2) — e")- 
approximation to the densest subset. 

The concave program might return a solution that does not correspond to a set (have some q* 
that does not equal to 0 or pj). However, using the pipage rounding [T] technique the algorithm can 
round it (make q^ equals either pj or 0) to have at most one undetermined item without any loss 
of value. If such an item remains, the algorithm compares the density of the solution that includes 
that item to the solution that does not include it and chooses the one with maximum marginal 
density. It is easy to verify that one of those solution has a higher density than the density of the 
fractional solution. This procedure might cause the block to cost 5 more so in total 1 -|- 1/e^ -|- 5- 

We chose e', e" and 5 < e' (thus the total cost of a block is at most 3/e') such that the total loss 
due to Lemma Ea is e. For large values of k the theorem follows by Lemma 12.41 and by noticing 
that in the analysis of that lemma we only analyze iterations where there is at least 3/e' budget 
left so this procedure returns a valid block in each such iteration. If k is not large enough (smaller 
than some constant that depends on e), we can enumerate over all first stage set of size at most k 


X; (1 - e-«) [/ ({i} u (rn |i - 1 ])) - /(rn |i - 1 ])] 

i 

^ q.) [/ ({i} U (T n [i - 1])) - / (T n [i - 1])] 

q.) [/ ({^} u (T n [i - 1])) - / (T n [i - i])] 


E 

i : a€A. 


E 

i : a^Ai 

1 - e-^ 


5 

1-e-^ 

■FM) • 
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and complete the solution by solving a monotone submodular maximization on the second stage to 
get a (1 — 1/e) approximation. □ 


Combining this with the results of the previous section we get that for MRS functions we have 
a (1 — 1/e)^ approximation for the adaptive seeding problem as desired. 


2.2.3 Hardness of approximating optimal e-block for general submodular functions 

Unfortunately, for general submodular functions the Submodular-optimization-with-small- 
probabilities problem cannot be approximated arbitrary well even with a constant budget (and 
as a special case, computing optimal densest e-blocks is hard). Our algorithm heavily relies on the 
reduction to a concave program, yet as pointed out by nans], this program is not concave in gen¬ 
eral. While this means our current approach fails it does not mean the problem is computationally 
hard. 

Standard techniques for showing hardness of submodular maximization seem to fail for this 
problem: Feige’s construction m would also show hardness for the max-cover version, but we 
know that this problem is easy in this setting. The symmetry gap |42j should give an information- 
theoretic lower bound (in the oracle model), but with a constant budget it is easy to design an 
exponential-time, poly-information algorithm that achieves an arbitrarily good approximation. 

We therefor look for a construction where the optimal set behaves locally very differently from a 
random set. In other words, when we look at a random (l/e)-subset of the optimal set, it should be 
different from what we expect from a random (l/e)-subset somewhere else in the graph. Intuitively, 
this is very similar to the Planted-Clique problem, where every subset of the clique is of course 
also a clique, but the rest of the graph may be arbitrarily (constant) sparse [2]. 

Theorem 2.9. If the Submodular-optimization-with-small-probabilities problem with 
a constant budget k can he approximated within any constant factor better than 
(l — e“^/^) / (l — (I + l) then there is a polynomial time algorithm for the Planted- 

Clique problem that succeeds with high probability. In particular, for k = 1.7, 

(^1 - Q -h 1^ « 0.865 

Proof. We reduce from the Densest /-subgraph problem. In particular, Alon et al. [2] proved that 
for any constant e > 0, given a graph G = {V, E) it is Planted-Clique harcjl to distinguish 
between: 


• Completeness: G contains a clique of size /; or 

• Soundness: every /-vertex subgraph of G is e-sparse (i.e. it contains at most an e-fraction of 
the ( 2 ) edges an /-clique would contain.) 

Given G, we construct a monotone submodular function that gives 1 for every subset that contains 
an edge, and otherwise approaches 1 exponentially with the size of the subset: 


f{T) 


1 3 (tt, n) G FI s.t. {u, u} C T 

1 — otherwise 


We set the maximal probabilities such that a budget of k is sufficient to bid exactly for the entire 
/-clique: pi = k/l. Monotonicity is trivial. 

®See [ 2 ] for precise statement. 
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• Submodularity: Adding u to T may increase / by at most 2 However, adding u to any 
T' CT would increase / by at least 

Now, observe that regardless of the choice of q, the random variable |T| (the size of the realized 
set) behaves approximately like a Poisson distribution with parameter k. More precisely, the total 
variation distance between |T| and Pois (k) is bounded ^ q? < k‘^/1 < € (e.g. [25]). 

• Completeness: Let q°P* be 1 on the /-clique and 0 otherwise; then 


E[/(ropt)] ^ 1 

> 1 
= 1 


ipr[|r| = l]-Pr[|T|=0] 

- Pr [Pois (k) = 1] — Pr [Pois (k) = 0] — e 





— 6 


• Soundness: Suppose that every /-subgraph of G is e-sparse. Observe that by submodularity 
of /, we can assume w.l.o.g. that q®'^® defines a set, i.e. it is pi for / vertices that contain at 
most 6 ( 2 ) edges (and 0 everywhere else). Then, 


E 



<1-j;pr[|r| 

i=0 


i\2 * Pr[r contains an edge] 


For each of the 6 ( 2 ) potential edges, the probability that both vertices belong to T is (^^^) / ( 2 ) • 
Taking a union bound over all of them, we have 


Pr[T contains an edge] < e 



< ejTj^ 


Finally, since jT] is distributed e-like Pois [k), 


Pr[r contains an edge] < e • {k^ + k) + e 


Therefore, 


E 



< 1 


= 1 


= 1 

= 1 


Pr [Pois {k) = i]2 ® -|- 0(e) 

i=0 
°° hi 

•£ + 0(e) 

e-fc /2 + o(e) 


Thus, for budget k, it is Planted-Clique hard to find any approximation which is better than 
(1 — / (1 — (I-|- 1 ) e“^). For example, set k = 1.7 to get the approximation factor of 

(1 — / (1 — + 1 ) < 0.865. (Recall that since we have fractional costs, the budget 

may be fractional as well.) □ 
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This lower bound implies that the non-adaptive framework we use here cannot obtain the 
(1 — 1/e)^ approximation raticEI. This obstacle motivates our use of e-locally adaptive policies 
discussed in the following section. 

3 Approximation via e-locally-adaptive policies 

In this section we prove our main result: 

Theorem 3.1. For every constant e > 0 there is an algorithm that runs in polynomial time and 
returns an adaptive policy which is a ((1 — 1/e)^ — e)-approximation to the optimal adaptive policy 
with general monotone submodular functions. 

Proof outline. Our entire proof relies on the novel definition of a restricted class of adaptive 
policies which we call e-locally-adaptive. Informally, we say that a policy is e-locally-adaptive, if it 
can be divided into e-blocks. In this context, an e-block is a subset of X of constant size (for technical 
reasons these are not singletons as in previous cases), and for each realization an adaptively chosen 
set of constant size of its neighbors. We prove that a greedy algorithm that in each iteration finds 
the optimal e-block gives a (1 — 1/e —e)-approximation to the optimal e-locally-adaptive policy. This 
adaptive variant of e-blocks allows us to find the optimal subset for each realization (much in the 
same way as in the warmup presented in the introduction) and thus find the optimal block. Thus, 
while the non-adaptive block structure allows for greedy optimizations, the power of adaptivity 
within a block circumvents the hardness result of the previous section. We then prove that the 
optimal e-locally-adaptive policy is a (I — l/e)-approximation to the optimal adaptive policy by 
using the fact that locally-adaptive policies dominate non-adaptive policies. In particular, we show 
we can convert a non-adaptive policy to an e-locally-adaptive policy, with arbitrarily small loss in 
value, and thus our bound follows. A natural question is then whether locally-adaptive policies are 
as good as adaptive policies. We answer that question negatively by presenting an example that 
exhibits a gap (« 0.853) between the optimal locally-adaptive policy and the optimal adaptive 
policy. We conclude this expository subsection by formally defining e-locally-adaptive policies. 

Definition 3.2. An (adaptive) e-block is a set S X of size at most I/e^ and for each realization 
Ri a set Ti C M{S) H Ri of size at most 2/e. The cost of a block B is = |5| -|- maxj(|rj|). An 
e-locally-adaptive policy is a set B of (not necessarily disjoint) e-blocksU 

Let Ti^B be the set seeded by block B in realization Ri and let Ti{B) = We abuse 

notation and generalize the value and cost functions to be applied on these policies. That is, we 
let the value of such a policy be F{B) = f {TiiB)) and its cost C{B) = 

optimal e-locally-adaptive policy with budget k is then: 

QPJIa = max{F(^) : C(i3) < k,'dB G B : \Sb\ < l/e^^i : ^ M{Sb) n Ri, {R^bI < 2/e} 

3.1 The Algorithm 

We now describe the LocallyAdaptiveGreedy algorithm. We run a greedy algorithm that in 
each iteration adds a new e-block to the current solution. The algorithm always adds a block 
with an optimal marginal density, i.e. a block which maximizes the ratio between the expected 
marginal contribution and cost. A formal description of the algorithm is included below. The 

^Note that this does not exclude worse approximations guarantees via non-adaptive policies. In [35] non-adaptive 
policies are used to obtain a (poor) constant approximation guarantee under knapsack constraints. In this paper, we 
are interested in obtaining the (1 — 1/e)^ approximation bounds. 

®Note that the blocks are not necessarily independent - Ti can depend on the entire Ri and not only on Af{S)nRi. 
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FindOptimalAdaptiveBlock subroutine simply enumerates over all subsets of size less than 
1/e^ of X and all budgets of size at most 2/e and returns the pair with the highest marginal 
density. 

Algorithm 2 LocallyAdaptiveGreedy 
Input: budget k,e 

2 : while C{B) < A; — ^ do 

3: B Bu FindOptimalAdaptiveBlock {B, e) 

4; end while 
5: return B 


Polynomial-size representation. As there are possibly exponential many realizations we can’t 
hope to output a full explicit description of a locally-adaptive policy. Our algorithm instead outputs 
for each block its first stage set Si and a budget ki for it to optimize in the second stage as well as 
an order over the blocks. At every realization the policy seeds the second stage nodes by going over 
the blocks by order and optimizing the choices of each block given only the choices made by the 
previous blocks. Note that this implicitly determines the content of each block. In our algorithm 
we implicitly assume the order on the blocks of B is the order in which the algorithms adds them 
to B. Note that we can approximate F{B) and Fq{B) (the marginal value of block B for policy B) 
for such a policy to any desired accuracy by sampling realizations and running this process on each 
of them (we thus assume in the analysis that we have an oracle for their value). 


3.2 Analysis 

Lemma 3.3. For any e > 0, let B he the solution returned by LocallyAdaptiveGreedy with 
input k,e. Then, F{B) > ^1 — 1/e — O 

Proof. We hrst show that the marginal density of the e-block chosen in each iteration of the algo¬ 
rithm is at least the marginal density of the optimal locally-adaptive policy. Note that we always 
have enough budget left to add a full sized block. Let Bj be the solution at the beginning of iteration 
j and let Bj be the block added at iteration j with kj = maxi{\Ti^Bj\)- Let Bq denote an optimal 
solution with value DPT^^. For every iteration of the algorithm we have that: 


FbABo) 


hdBp iTABo)) 


k 


< 




< max 
B&Bo 


Y^BeBo 

-R fTi{Bj) 


<- 


-R^ 


C{B) 

fTi(Bj) {Ti,Bj) 


1 ‘S'jl + % 


Fb,{BA 

|5j| -I- kj 
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The first inequality is from the submodularity of /, and the third holds because the algorithm 
enumerates over all e-blocks as candidates in each iteration - including over the blocks of Bq- 
We proceed by induction to show that at any iteration we have that 


F(^,+i)> l-II 1 


i=i 


\Si\ + ki 
k 


DPT 


LA 


The base case is trivial. Assume it is true for F{Bj). Then we have that 


F(B,+i)>F{B,) + 


I'S'jl + kj 


(OPTIa - 


= 1 - 


> 1 - 




\Sj\+ kj 


+ 


1‘S'jl + kj 


OPT 


LA 


J -1 


i-nb 


1=1 


\Si\+ki 

k 


I S','! -I- kj 

OPTIa + ' ' OPTIa 


1-n 1 


1=1 


\Si\ + ki 


OPT 


LA 


where the first inequality is from the previous derivation and the second inequality is by the 
induction hypothesis. 

When the algorithm ends the cost of the solution is greater than A:' > (l — k. Therefore, 


F{B) > 11 - n (1 

J =1 


\Fj\ + kj 

k 


OPT 


LA 


> ( 1 - (1 - OPTIa 


> 1 - 


,k'/k 


OPT 


LA 




OPT' 


LA 


where the second inequality is because the solution’s cost is at least k' and this expression is 
minimized when the cost is evenly spread over the iterations. □ 

Adaptivity gap of e-locally-adaptive policies. We now show that e-locally-adaptive policies 
can arbitrarily approximate non-adaptive policies, which implies our bound. The high level idea 
is to show that there are good non-adaptive policies that have a block structure and thus can be 
converted to locally-adaptive policies. We first define the notion of e-local for non-adaptive policies: 

Definition 3.4. A budgeted e-block is a triplet (S,k,T) such that S F X is of size at most 
l/e^,i < A: < I and T C J\f{S) satisfies Ylj&TPj — e-local non-adaptive policy is a set £ of 

budgeted e-blocks. The cost of C is C{C) = ^bgc 

We now prove that a non-adaptive policy can be converted into an e-local non-adaptive policy. 
Let T{C) be the union of all of £’s blocks second stage sets. 

Lemma 3.5. For any non-adaptive policy {S,T) in which |5| -|-C(T) > 3/e^, there exists an e-local 
non-adaptive policy L of the same cost such that F{T{C)) > (1 — 3e)F(T). 
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Proof. Let k = |5| +C(T). Fix some order (xi .. .X| 5 |) on the elements of S. We say that a second- 
stage node y G AA(S') H T belongs to Xj if Xj is the smallest-indexed node in S with an edge to y. 
For a set Q ^ S, let 'Tt{Q) be the nodes that belong to nodes in Q. Iteratively construct the blocks 
of L by adding nodes from S into sets Si by the order (xi... X| 5 |), until either 

• 2/e > C(7^(5^)) > 1/e - set Ti = 7^(5*) and h = C(7^(5^)). 

• C{lZ{Si)) > 2/e - let be the maximal set of nodes of 7Z(x), where x is the last node added 
to Si, that can be added to Tl{Si \ x) such that the cost remains under 2/e (there is at least 
one such node since a node’s cost is at most 1); set Tj = TZ{Si \ x) U and ki = C{Ti)\ start 
the next set Sj+i again with x but ignore the nodes in <I>. 

• [S'/ = 1/e^ or we are done - set Ti = lZ{Si) and ki = 1/e. 

The first condition incurs no extra cost. The second condition applies at most [eC(r)] times 
and incurs an extra cost of 1 for the duplicated node, and in total at most eC(T) -|- 1. The third 
condition applies at most [e^lSI] times and it incurs an extra cost of at most 1/e, and in total at 
most els'! -|- 1/e. The total additional cost incurred is therefore at most eA:-|-l-|-l/e< 2ek. 

Iteratively remove from C the blocks whose marginal density is the lowest until C{C) < k. Their 
total cost at most 2ek -\-{1 /-\-2/e) < 2>ek. Thus, by submodularity, F{T{C)) > (1 — 3e)F(T). □ 

Our last step is to show that given an e-local non-adaptive policy, we can construct an e-locally- 
adaptive policy with almost the same value. The following lemma follows from Lemma 12.21 by 
analyzing each block of the policy independently. The e^ is needed to match the condition on the 
budget left for the second stage in that Lemma. 

Lemma 3.6. For any e such that e G (0,1/5), and for any e'^-local non-adaptive policy £ there 
exists an e'^-locally-adaptive policy B with value > (1 — 2e) F{T{C)). 

Putting it all together, we have that for every constant e > 0 there exist ei,€ 2 ,e 3 such that: 

F{B) > (1 - 1/e - ei) DPT)/^ > (1 - 1/e - eg) OPT^a > ((1 - l/ef - e) OPTa 

where the first inequality follows from Lemma 13.31 the second from Lemma 13.61 and the third from 
the results of the previous section. This completes our proof of Theorem 13.II □ 

3.3 Separation between e-locally-adaptive and adaptive policies 

We complement this section with an example that exhibits a gap between the values of the optimal 
e-local-adaptive policy and the optimal adaptive policy. It remains open whether the optimal gap 
is (1 — 1/e) as our upper bound suggests. Any better upper bound on the gap will immediately 
imply a better approximation bound for Algorithm LocallyAdaptiveGreedy. 

Lemma 3.7. There are instances where every e-locally-adaptive policy achieves at most ~ 0.853 of 
the value of the optimal adaptive policy. 

Proof. We construct an instance where it is advantageous to move large amounts of the budget after 
seeing the realizations. Since this is only possible in a fully adaptive (i.e. not e-locally-adaptive) 
policy, we obtain a separation between the two classes of solutions. 

Let m be a large parameter, and consider an instance where the first stage has |A| = m nodes. 
Each node x G A is connected to one special node yx ^ Y, and it also has regular neighbors 
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zlj. ^ Zx C. Z. The special nodes in Y realize with probability 1/m, while the regular nodes in Z 
realize with probability 1. Let the budget he k = m? + m + 1. Finally, consider the function: 

/(r) = i-n(i->nj,.|-^|rnz.T 

To see that / is indeed submodular, consider the following observation by Dughmi and Vondrak m- 
For any monotone submodular functions gi and 52 with values in [0,1], g{S) := 1 — (1 — 5i(5))(l — 
52 (<5)) is also monotone submodular. Applying this lemma recursively, we see that for any number 
of monotone submodular functions gi... gt with values in [0,1], g{S) := 1 — ni(l “ is also 

monotone submodular. Finally, / is submodular since it can be written in this form for coverage 
functions fa;{T) = ^\T n - ^\T n Z^\. 

Optimal adaptive: The optimal adaptive solution seeds all the first-stage nodes. With probability 
1 — 1/e, one of the special nodes yx realizes, in which case the optimal policy seeds as well 
as all the nodes Zx G Zx- In this case, fiT) = 1. With probability 1/e, none of the special 
nodes realize. In this case, the adaptive policy picks x arbitrarily, and seeds all the nodes 
Zx € Zx- In this case /(T) = 1/2. In expectation, E[/(T)] = 1 — l/(2e). 

e-Iocally adaptive: Given a realization, we say that a block Si is special if it contains a node with 
a realized special neighbor, and regular otherwise. Observe that in any locally adaptive policy, 
the probability that a block Si is special is less than l/(me^) by union bound. Therefore, the 
expected total budget ki allocated to special blocks Si is at most 0{mje^), i.e. negligible in 
comparison io\Zx\ = m^. Therefore any locally adaptive policy must spend all but a negligible 
amount of its budget on regular subsets. (It may still seed all the realized special nodes, but 
with only a negligible fraction of their neighbors). Let ^ denote the number of realized special 
nodes. Conditioning on we have, 

E1/ (O I £] < 1 - 2 -v (1 - ^|r n + 0(1). 

The optimal way to spend the budget on regular blocks is to pick some x arbitrarily, and seed 
(almost) all the nodes Zx G Zx, leaving sufficient budget to seed any realized special nodes. 
In particular, for any feasible T, 

xex ^ ^ xex 

Finally, notice that the distribution of ^ is approximately Pois(l). The expected value of the 
locally adaptive policy is therefore: 

E[/(r)] p 


Thus the ratio between the values of the policies is ~ 0.853. □ 


- 

1 - - • ^Pr [Pois(l) = i] 2“* 

i=0 
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A Chernoff bound 

We use the following bound in the paper (See for example [30l Theorem 4.4]). 

Theorem A.l (Chernoff bound). Suppose Xi,...,Xn are independent random variables taking 
values in [0,1] and let X denote their sum and be the expeeted value of X. Then, for 0 < <5 < 1 
we have that 

Pr[X > (1 + d)fi] < exp(-^) 
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